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Abstract 



This report suramarizes the research conducted at the Ins titute , 
under support of the National Science Foundation, concerning technical 
problems in implementing university-level computer-assisted instruction 
in mathematics and science. Two areas of research have been pursued: 

1) Informal laathenatical procedures * The 
development of sophisticated and efficient methods for 
students to use in interactive proof procedures. In 
particular, the research has concentrated on providing 
both low-level (e.g., ^intelligent' typing aids) and 
high-level (e.g., proof strategy mechanisms) procedures 
to enable students to follow standard mathematical 
practice as far as possible. 

2) Audio and prosodic features . Audio research has 
concentrated on design considerations for the Mini-MISS 
machine, especially the problem of using interpolation 
of linear predictive coding of speech to achieve a low 
transmission rate without sacrificing speech quality. 
Also, an on-line prosodic generation procedure was 
implemented, and, improvements to the syntactic analysis 
and lexicon were made. Finally two studies of the MISS 
prosodic quality were carried out. 

In addition to discussion of theje topics, this report includes a 
full bibliography of presentations and publications supported by NSF 
during the first two years of this grant. 
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Int roclticc Ion 



This report aummarizQa the research conducted at the Institute Eor 
Mathematical Studies in the Social Sciences (IMSSS) at Stanford, under 
support o£ the National Science Foundation, concerning technical 
problems in implementing university-level computer-assisted instruction 
(CAI) in mathematics and science. We have pursued research in two areas 
relating to CAI: informal mathematical procedures and the use o£ audio 
messages with prosodic features. Our overall objective has been to 
research and develop the use of computers for instructional purposes. 
An overview of this project, together with a review of other CAI 
projects can be found in Suppes (1979). 

In the area of informal mathematical procedures, we have developed 
sophisticated and efficient methods by which students can use 
interactive proof procedures in courses such as "Introduction to logic" 
and "Set theory". Part of the effort during this grant year has been 
devoted to modularizing and streamlining the programs that present and 
control the student courses and providing these courses with useful low- 
level aids such as command and formula recognition, on-line help and 
editing. A fuller discussion of the streamlining and student aids is 
given in Section 2.1. 

Several different high-level procedures have been developed during 
this grant year. These procedures share a common goal of making it 
possible for the student to follow standard mathematical practice in 
doing proofs with the result that more of the routine work is shifted 
from the student to the computer. A procedure to execute proof 
strategies that reduce problems and goals to subproblems and subgoals is 
discussed below, in Section 2^:?. Guidance to students attempting to 
solve problems is provided by utiliEing the theorem prover as a 
knowledge base (see Section 2.3). A more sophisticated inference 
procedure was also added during this year (Section 2.4). 

A new exercise type has been added to the curriculum during this 
grant year. This exercise type directs the student to provide finite 
objects satisfying given (logical) conditions. The structure of the 
exercise allows the computer to provide some automatic explanation of 
wrong answers- We discuss such exercises below. Section 2.5. 

In the area of audio and prosodic features, a major avenue of 
research has been the design considerations for the Mini-MISS machine , 
especially the problem of using interpolation of linear predictive 
coding of speech to achieve a low transmission rate without sacrificing 
speech quality. Section 3.1 discusses these studies and an experiment 
we are conducting to determine the perceptual effect of a particular 
interpolation model. 

Another focus of audio research has been on the improvement of the 
linguistic analysis and prosodic generation algorithms. A simple model 
of a "story-^view" analysis of dialogue is being extended to generate 
paragraph prosodic contours (see Section 3.2.1). Improvements to the 
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ayncnctic annlyaia nncl pronodlc RGnernnian routines to utUUo Bcamnnttc 
Inalghca have been Implemented (Section 3.2,2). An on-line proaoclic 
Roneration procedure waa implemented (see Section 3.2.3), provldinp; 
the capability for production of audio moaanReH generated during 
execution of the curriculum. 

Two studies of the MISS prosodic quality were carried out. The 
first study examined the relative contributions to unnaturalness of 
different components of the method of word concatenation with prosodic 
manipulation as implemented In MISS (Section 3.3,l), The second 
evaluated prosodic quality by examining student use of audio In the 
course "Introduction to Logic" at Stanford University (Section 
3,3. 2). Further Improvements to the audio generation are being 
planned In response to the results of these studies. 

A substantial effort has been made to disseminate research results 
as described In this report, and engage In dialogue with other research 
groups. Project members have presented and published papers and 
lectures, and participated In panels dealing with both the theory and 
applications of research by us and others. This report Includes a full 
bibliography (Appendix A) of these presentations and publications, 
supported by NSF, during the first two years of this grant. 
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Tn[:nrma I Madhamat Lcnl ProcarlnreH 



The work In Informal TTiathamat: 
iig deacrlbecl In t:hG noction on 
proposal (Suppofi (1977), p^ 82). 
nnd deacrlbe tho work accompllahed 



Lcnl proceduroH Ui boinp; cnrrlQfl out 
proposed roBearcli In tlui original 
Ralow, we roviow tha work propound, 
durtnp; the oecond year. 



2m I General Sjya torn work 



We have implemented a complex system of instructional programs, 
EXCHECK, which presents the courses in introductory logic and sat 
theory, and facilitates the development of curriculum for these coursest 
A primary goal of the design of EXCHECK ia olmplicity of use without 
sacrifice in the sophistication of the material to be taught or the 
modes used to present it. EXCHECK has been implemented ao that lesson 
authors need not have any general knowledge of programming languages or 
techniques in order to write complex curriculum materials, and students 
are not presumed to have knowledge beyond that appropriate to the 
subject being taught. This implementation has been done in a modular 
way to facilitate program maintenance, modification and expansion (see 
below. Section 2. 1 • 1) . An important part of the modularization is 
the use of symbolic referencing of messages (see Section 2.1.2). We 
discuss some low-level student aids in Section 2.1.3, and the general 
input mechanism in Section 2.1.4. 

2. 1> 1 Description of the EXCHECK system 

The EXCHECK system has a modular design in which various functions 
have been segregated into separate programs, as seen below. Figure 1. 
Using features inherent in the TENEX operating system at IMSSS, these 
programs run as a single large program with interacting components. All 
curriculum lessons are written in VOCAL (Voice Oriented Curriculum 
Author Language, Hinckley et al (1976)). The VOCAL compiler produces 
interpretable lesson code from VOCAL lessons. The EXCHECK driver 
interprets these lessons and governs each student's progress through the 
course. It may invoke a proof checker to present the derivation of 
proofs or the construction of explicit mathematical examples , described 
elsewhere in this report. The proof checker has access to a theorem 
prover and to REDUCE, a program for algebraic reductions Hearn (1973). 
The audio program is described in detail in Levine and Sanders (1978) 
and below. Section 3. Each course has its own parser for handling 
complex input specific to the course (e.g., formulas ) which comes from 
the student. 

The ability of EXCHECK to run as a dynamic configuration of 
programs depends upon the multiprocessing facility of TENEX to 
coordinate many core images (**forks") as one program, without the need 
to program explicit overlays. When a student logs in the EXCHECK course 
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FlRuru !• Diagram oE the KXCHECK ayaCom. 

driver ia started and it decides which Eorko to aaaemble Por that 
student. Each fork can be written in a different pro^ramminR InnfimnRe, 
allowing programs written at other sites, such as REDUCfS 
Marti et al (1977), to be readily incorporated for EXCUECK courses. 



2.1.2 Symbolic referencing of messages and interf ork communication 

A wide variety of messages are transmitted to the student during a 
typical session. These include error messages, prompts for answers, 
information about possible answers and hints for doing proofs . These 
messages may be either displayed on the terminal or generated with audio 
through headphones. As these messages have begun to occupy a large 
amount of space in the programs , a facility was added to EXCHECK for 
referencing these messages symbolically from a file, as opposed to 
keeping the message itself in the core image. Beyond saving a 
significant amount of space, this feature simplifies the maintenance of 
messages and encourages the writing of more descriptive messages. 

A great deal of information is passed between the various forks - 
during an EXCHECK lesson. For example, the course driver and ^proof 
checker (which are separate forks) often need to access routines 
contained in the other fork. The simple fork structure makes it awkward 
for the proof checker to call routines in the driver fork, and also 
requires the driver to call the proof checking fork with the printing 
form of the LISP-style s-expressions which the proof checker must then 
read and interpret even though these s-expressions have almost always 
already parsed in the driver fork. We are altering the forks to share a 
large data area for lessons. This will allow the calling fork to pass 
only s-expression pointers through a fork call to provide the essential 
context for any routine in the other fork. By using this large data 
area we can also make the forks "co-routirie" so that the proof checker 
can access driver routines. Using this data area, the VOCAL compiler 
can save pre-parsed s-expressions on disk files so that the driver need 
not parse them. The savings in execution time from not re-parsing s- 
expressions is substantial. 
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Th^ KXCllRGK coura^ driver which tnt:0rprt3(:t^ let^s^ona prapfirad by the 
VOOAli pnmplbu' h49 three h^^t^lc modest for aurrtoulufn pri^t^dutcit: ion ; 
(.HSaON, IMWWSE And HRT.P. The I.RSaON^ pr^p^rBd for t^ha m\ln currlcuhim 
dtrecc i:hi^ ^^ner^l patcern of ^tudanta' proRr0ii3 rhrouRh coMrt^fJti, Tun 
nUOWHR facUUy Intended to t^lniul^te n t^tuilent'^ ahllity to skim 
forward or h^iokWfird while ref^dlnfi a tenthoolc or uote^t The HtudentQ 
ixlm review find umnipulate nuiterUl they hfive nre^ted, snoh fia stored 
ver^lon^ of. their proofs • apePlAl lesaonti are ^ivnllnbie to URLP 
atudente on polnte not oovered by the psenerni onrr IrulMn, fe<m?h as nee of 
the Gompnter ayKiteni, hours for tefuihlnp; aa^Utantfl, and datint'lpt Une ot 
th« Inference rulea available for iloln^ n proofi A nuna oomplet*^ 
doHcrlption of the RXGUKGK Qonrae driver and atudont nlda in avnllnble 
In McDonald at al (I97fl)i 



2 , K A VINPIIT 

All cho Input in KXCIIRCK iti obtained uhIuw one p^oneral rnnt luri 
cniled VTNPIJT. The major ndvancnge of utUnR VINPUT in tWti it: 
HyflcemncizoB acudunr. incernccionH . UslnR ^ nlURlo rouclno « Imp 1 * i t<»--) 
cho documencacion ut Eoacurea, and minlmizoa cho knowlodgo ocudent.^i m\t\t 
have of input routines. VINPUT nlao has CoacuroH that riuUice the amount 
of typing required and provide aaalatanco in dutormlninR the options 
available. One auch feature is input roco^^nition • When a uaor 
terminates an input string, if that string is conaiatent with at least 
one legal command, then the program will automatically add charactera 
until an ambiguity arises or some option is completed* The atudont can 
then continue typing to disambiguate the command, if necessary. 

At any prompt, the student can type a and then get a brief 

description of any command that was legal at that prompt. The mode 
entered by typing is Indicated to the student by prompting with 

Typing a question mark after some input will show all the 

alternatives which are consistent with the input so far. The 
feature interacts with recognition, so that if recognition halts, 
can be used to examine the alternatives before :ontinuing« 

VINPUT also offers a means through which the student can 
recursively access HELP, BROWSE and other special features at any point 
in the course. In the example shown below. Figure 2, the student 
entered BROWSE while in HELP, which was entered while doing a PROOF, 
entered while the student was in (an earlier) HELP which was entered 
from the main lesson. The student obtained a list of the legal commands 
by typing followed by more information about the default command, 

and, finally selected the default ("ok" took the student out of the mode 
entered with a 

In complicated situations, such as In Figure 2, it is crucial for 
the program to provide a student interface that is easy to understand 
and use. 
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*TL You are in BROWSE inside HELP Inside a PROOF inside HELP 

inside your MAIN LESSON. 
Where to? ( HELPl ): 2 

You are in BROWSE inside HELP inside a PROOF inside HELP 
inside your MAIN LESSON. 

?? stay here, browse, helpl, proof, help2, main lesson, logout 
[Student typed the ESC key to see a list of the options.] 
?? HELP1$ will return you to the help lesson immediately above this 
browse level 

[Student asked for information about one of those options.] 

?? ok£ 

Where to? ( HELPl ): ^ 
Returning to HELP above BROWSE... 

Figure 2. A sample student dialog. 



2» 1*5 Data Collection 

Procedures are being implemented and extended to collect and 
summarize considerable amounts of objective data on student proofs, 
including data on response latencies, and connect-times globally and 
locally for individual proofs. We will also collect and summarize data 
such as number and kind of student errors and inputs during proofs. 
These data will provide a basis for comparing versions of the theorem 
prover, and doing regression analyses on the relationship between errors 
and other objective variables. They will be of general use in the 
evaluation of the usefulness and naturalness of features of the EXCHECK 
system. 



2» 2 Proof Procedures and the Structuring of Proofs 



Research in the areq of proof procedures concentrates on developing 
the ability of the proof checker to interact with the student in the 
informal style common to standard mathematical practice. Fundamental 
progress in this area will also have direct applications to 
mathemacically oriented courses in such disciplines as physics and 
chemistry. 

We quote from the proposed research section (pages 41-42) of the 
original proposal: 



The first main component of the research in 
this area will be to redesign the EXCHECK proof 
checker to function not only as an interactive proof 
checker but also as an interactive theorem prover 



capable of accepting and executing complex proof 

procedures or strategies. The use of such 

procedures will mean that substantially more of the 

routine work of presenting proofs will be shifted 

from the student to the program, A good example of 

this from standard mathematical practice is saying 

beforehand that a proof being presented will be by 

induction, by cases, by the Axiom of Extensionality, 

etc . Such proof procedures determine a global 

structure on the proof being presented. Attention is 

then selectively focused on the various subparts of 

the proof, and in this way the global structure of ^ 

the proof is sketched out with the details added as 

needed. 

During the second year the EXCHECK proof checker was expanded to 
function as an interactive theorem prover following the design worked 
out during the first year. This work is not yet complete but all the 
basic procedures needed for EXCHECK to function as an interactive 
theorem prover capable of accepting and executing proof strategies, 
particularly reductive proof procedures, have been implemented. 

Reductive proof procedures reduce the current problem or goal to a 
set of subproblems or sub goals . For example, corresponding to the 
procedure for introducing a conditional into a proof, there is a 
reduction procedure for conditionals. When applied to a problem in 
which the goal is a conditional, the conditional reduction procedure 
will generate a new subproblem in which the antecedent of the 
conditional is a new assumption and the consequent of the conditional is 
the new subgoal. Schematically we write this as in Figure 3. In the 
reduction procedures shown here, the expression to the left of an ^|=»>' 
denotes the allowed assumptions and the expression to the right denotes 
the goal# In this figure, the upper line reads: prove A— ^C from S« 
The whole schema reads: reduce the problem ^prove A— C from 35' to the 
subproblem ^prove C fromJ5*,A'. 

25 A-^C 

|-> C 

Figure 3. Sample reduction. 

A major benefit of the use of interactive reductive proof 
procedures is the imposition of a global structure on students' proofs. 
The proof checking program uses the global structure to provide 
bookkeeping aid to the student, keeping track of the current context. 
The finer notion of context available to the program also greatly 
enhances the guidance capabilities of the program. Basic guidance is 
given by saying what the heuristic natural deduct ion theorem prover 
itself would do on the next step . This is usually presented to the 
student as a default option ; i.e., the student can simply accept the 



program's suggestion about what to do next by typing the ESC key. The 
example below illustrates some these benefits as well as highly 
interactive nature oE the system. 

2.2. 1 Usin.^ the Heuristic Natural Deduction Theorem Prover 

This section presents a proof in an elementary set theoretic system 
using the interactive heuristic natural deduction theorem prover. When 
a student is doing a proof, the initial goal is the theorem to be 
proven. As reductive proof procedures are applied, the proof is broken 
into subparts. The subparts constitute a hierarchy of contexts, in 
which previous results and assumptions from superior contexts are 
available in tha current subpart. After each application of a reductive 
proof procedure the display is updated to show the new context. 

Student input is underlined in all the examples in this report. 
Our comments (which the student does not see) are enclosed in square 
brackets and indented. Everything else in these examples is generated 
by the program while communicating with the student. We separate 
contexts into separate frames by horizontal lines. For this example we 
assume that the student has chosen a printing format in which the path 
of reductions to the current goal is always shown. 



Derive: U pB c p(AuB)! 
^redu ce 

Which proof procedure? (definition) *dheorem 
Number or Name (2.8.2) *^2.8.2 

(The student signals the program that he wishes to reduce the 
current goal. The program asks which proof procedure the 
student wishes to use, suggesting that he use a definition 
(here, the definition of subset) to eliminate the major 
defined symbol of the current goal. Hox^ever, the student 
wishes to do the proof in a different manner, using a 
theorem instead of the default. The program looks through 
the available theorems to determine which might work. Based 
upon a pattern matching routine the program suggests theorem 
2.8.2. (which states that if each of A,B is a subset of C 
then A B is a subset of C) . The student chooses this 
default, and the program applies the reduction. The result 
is shown in the next display.] 
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Derive: f>k l/^B ^^(AUB) 

Show: (G2) ^A ^^(Al/B) and ^B S^^Av/B) 

* reduce 

Which proof procedure? (conjunction) *_$con junction 

[The current goal is now a conjunction. The student chooses 
the default to break it into two subproblems and prove each 
conjunct separately.] 



De rive : p A I' ^ B p(Alfe ) 

Show: (G2) pA fitp(AUB) and^B^^CA B) 

Show: (G3) ^A £ pCAi/B) 

*redu ce 

IThich proof procedure? (definition) *^definition 
Number or Name (subset) *^subset 

[In the previous frame we saw that the original goal was 
reduced to a conjunction. The goal is now the first 
conjunct. The student chooses to reduce the current goal by 
applying the definition of subset.] 



Derive: ^A V f& ^^A(/B) 

Show: (G2) ^ ^^(Al/B) and 0^(Al/B) 

Show: (G3) f>k ^ fJik\B) 

Show: (G5) (/c)(C^ pA->C ^ ^(Al/B)) 

*redu ce 

VJhich proof procedure? (universal) ^eliminate 

Mhxch svzibols? (6) *_$e 

What reducer? (theorem) *_$ theorem 

Number or Name (powerset) *^owerset 

Occurrences (all) *J.all 

[The student directs the program to eliminate the two 
occurrences of ^^S' using the theorem of powerset •] 



16 

9 



Derive: /»A U f& ^^ktjB) 

Show: (G2) and ^-j)(Al/B) ■ 

Show: (G3) ^^AVB) j 

Show: (G5) (/ C)(C^ ^A-^C ^ ^AV/B)) 

Show: (G6) (\rc)(C A-^C ^ AUB) 

*boole (1) *2£ (V^ C)(C A-;^ C A(;3) 

1 replace using th. powerset 

(2) (/C)(C 6^A -^C 6 f(AUB)) 

2 Df. subset 

(3) ^A S^(AUB) 

[The current goal is an obvious truth of boolean algebra. 
With the notion of the current goal the student can get the 
formula corresponding to the current goal by typing ^^G ' • 
The ^ ^G ' feature greatly reduces the amount of typing and 
hence the number of typing mistakes ^ When the BOOLE 
procedure accepts the current goal, the procedures of the 
theorem prover work back through the previous goals. 
Additional lines are generated to satisfy previously 
unsatisfied goals as their subgoals become satisfied. Line 
(2) was automatically generated by the program because its 
only subgoal had just been satisfied by line (1). Similarly 
line (3) was automatically generated when (2) was used to 
satisfy its only subgoal.] 



Derive: ^A l/^B ^^(AUB) 

Show: (02) ^A ^^(Al/B) and ^B ^^(AOB) ; 

Show: (G4) ^B ^(AL®) 

*red uce 

Which proof procedure? (definition) *_$def initioa 
Number or Name (subset) *$^subset 



[The current goal is now the second conjunct from the 
original goal. The student is about to do this proof in the 
same manner as for the first conjunct. The next two frames 
show the similarity of the second sub-proof to the first.] 
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Derive; p A I'^B ^ ~ 

Show; (G2) pA 5ip(AUB) and^B^^(A B) 

Show; (G4) ^B ^^(aUb) 

Show: (G7) (\/C)(C 6^B-^C Ap(AUB)) 

*redu ce 

Which proof procedure? (universal) ^eliminate 

Which symbols? (B) 

What reducer? (theorem) *$^theorem 

Number or Name (powerset) *Jpowerset 

Occurrences (all) *$all 



Derive: ^ kO pB 

Show; (02) pA ^ip(AUB) and^BS^(A B) 

Show; (04) ^B ^^(AJb) 

Show: (07) (\/C)(C ^?f>B-^C Ap(AOB)) 

Show; (08) ( t/C) (C S-B ->C s:A\/B) 

*boole (4) (V'C)(C ^ B -^C SAUb) 

4 replace using th, powerset 

(5) (\/C)(C ^^B-?C 6 rfAl/B)) 

5 Df. subset ^ 

(6) pB :£p(Al/B) 

3,6fc (7) pA ^p(kUE) and pB ^ p(AUB) 
7 Th. 2.8.2 ^ ^ 

(8) l/pB gp(At;B) 



Derive: U & ^At/B) 

boole (1) (V C)(C S A-^C S^AUB) 

1 replace using th. powerset 

(2) (VC)(C^^A-^C a ^(MJB)) 

2 Df. subset 

(3) ^A jS.p(AvJB) 

boole (4) (\/C)(CjS B -^C S Ai/B) 

4 replace using th. powerset 

(5) (\/C)(C 6^B-> C <£: ^(AUB)) 

5 Df • subset 

(6) pB ^^(AOB) 

3, etc (7) pA3p(Atm) and /oBS p(Ai;B) 
7 Th. 2.8.2 

(8) ^A UpB ^^Al/B) 

Use QED to finish your proof. 



[All the subgoals have been satisfied and the proof is now 

conceptually complete. The program produces the completed 

proof and waits for the student to type QED. This allows 
the student to review the entire proof,] 



2,2.2 Proposed research on the heuristic natural jdv\duction theorem p rover 

In the coming year, we will refine and extend the heuristic natural 
deduction theorem prover. Obvious improvements can be made by extending 
the prover to accept complex proof procedures composed of simpler ones. 
One such procedure suggested by the example above is one that allows 
students to specify that the proof of a subgoal is ^similar' (in the 
sense of using the same proof procedures) to the proof of a previous 
subgoal. Mechanisms will be added to facilitate proof procedures which 
operate by letting the prover run in an automatic mode, stopping only at 
preselected points — for example, those in which no default is 
available, or those which involve a particular reduction. 

We vlll also explore the possibility of substantially more 
sophisticated procedures. One example is a procedure which can choose 
the terms with which to prove existential formulas. Picking terms with 
desired characteristics is often the crux of a proof. Another example 
is a procedure that will make intelligent decisions about when to do 
proofs by cases, and what those cases should be. Another sophisticated 
procedure would allow one to do proofs in a manner similar to previously 
recorded proofs. This is a particularly important procedure, since 
there certainly will be proof procedures which can be demonstrated by 
clear examples, yet which lie beyond or outside the scope of our 
heuristic prover. 
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Evaluation of the heuristic natural deduction Lheorem p rover 



Evaluation of the interactive theorem prover in the third year of 
the grant will focus on student use. Where applicable, standard 
statistical tests (such as chi-square) will be used to analyze the data 
collected and evaluate hypotheses on student use of the theorem prover; 
If warranted by the data, more sophisticated (e.g., Markov) models of 
student behavior will be tested. 

One important aspect of student use of the interactive prover is 
how often they use its more powerful features. For individual students, 
we want to know if use of powerful features increases, decreases or 
remains constant as they progress through the course. We will also 
examine the kind of use that is made of the prover: is it relatively 
uniform or dominated by individual differences with respect to amount of 
use or particular instances of use. Use that can be related to 
structural features of the theorems or proofs will be extremely valuable 
in evaluating the procedures used, and suggesting improvements. 

Another broad area of concern is whether or not students are 
learning and using higher-level strategies and algorithms in completing 
their proofs. We will study this in part by including a facility for 
students to type sequences of commands as a single command- The use 
made of such a facility, and especially increases in such use, will be 
an indication of whether students are using and learning complex proof 
strategies . 



2.3 Student aids 

Research in this area is concentrated on developing procedures for 
providing guidance to student* attempting to solve problems. Again we 
quote from the proposed research section of the original proposal (page 
40): 

Essentially the same proof procedures and 
strategies used in the interactive theorem prover 
can be used by students to express plans for their 
proofs and also to experiment with finding proofs by 
successive reductions. Such thinking out of 
possible approaches to the proof is, of course, in 
keeping with standard practice and providing 
facilities to aid this process would considerably 
upgrade the instructional capability of the system. 
Guidance will be provided the student by examining 
the collection of proof procedures and strategies to 
find those applicable to his current partial proof. 

In the logic course guidance will be provided 
students by using a theorem prover to find 
continuations of their partially completed proofs. 
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In later stages of the research we will analyze 
methods for informally describinj^ the continuations 
to determine those most effective pedagogically . 

The interactive theorem prover as described in Section 2.2 nakes it 
possible to provide several forms of guidance to the student. By 
comparing a student's current goals and assumptions with the consequents 
and antecedents of various theorems, reasonable alternatives for 
continuing the proof can be described in a fairly complete tnanner. The 
prover then provides guidance by suggesting different default reductions 
based upon these alternatives. 

Other forms of guidance will be derived from the relationship 
between the general proof strategies of the theorem prover and 
heuristics which generate departures from those strategies. Of 
particular interest will be higher-level algorithms involving complex 
sequences of inference procedures (especially reductions) . It is in 
such structural features that further guidance for the student can most 
profitably be rooted. 

It should be noted that the interactive theorem prover is in itself 
a very effective (albeit incomplete) guidance mechanism. That is, the 
theorem prover with default reductions is generally effective in 
generating proofs, and presenting these defaults constitutes guidance to 
the student. Comparisons between proofs completed with and without 
additional guidance beyond the default reductions will therefore be of 
great interest. We will compare these student proofs on the basis of 
their similarity to traditional proofs, and on the basis of objective 
data on student proofs (see Section 2.1). 



2*4 More Powerful Inference Procedures 

Research in this area is concentrated on developing "natural' 
inference procedures. We consider inference procedures to be natural 
when they justify the inferential steps actually made in standard 
mathematical practice, and, they are easy for students to understand and 
use. See pages 40 and 51-55 of the original proposal (Suppes (1977)) 
for fuller discussion of natural inference procedures. Some simple 
examples of natural inference procedures are the HYPOTHESIS and RAA 
(reductio ad absurdum) procedures familiar from standard mathematical 
practice and common logical systems. A more complex natural inference 
procedure is the IMPLIES procedure. It is used to derive results that 
follow by applying a previous result or definition. I14PLIES is 
described on pages 19-20 and 53-54 of the original proposal. 

During the second year we added a new high-level natural inference 
procedure ESTABLISH to the EXCHECIC system. It replaces the old VERIFY 
procedure which was described in the original proposal on pages 16-19 
and 51-53. The fundamental difference between ESTABLISH and VERIFY is 
that ESTABLISH can be used to derive results that are consequences of 
prior results iii the theory under consideration while VERIFY only can be 
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used CO derive results that are logical consequences of prior results. 
In particular, in the set theory course, ESTABLISH can be used to decide 
simple set theoretic consequence while VERIFY is restricted to logical 
consequence. The examples in Figure 4 and Figure 5 illustrate the 
distinction between the two procedures. 



(i) B £ A 
( j ) A S B 

*i,iv erify *B A 

Using *a^xiom (Number or Name) *e^xtentionality 
Using *def inition (Number or Name) ^subs et 
Using *ok 

i,j Verify Using: Ax. Extensionality , Df. Subset 
(4) B = A 

Figure 4. A simple inference using VERIFY. 



(i) B £ A 
(j) ASB 

*i,1esta blish *B = A 
i, j Establish 

(4) B = A 

Figure 5. The same inference using ESTABLISH. 

Because "^A = B' is a set-theoretic consequence but not a logical 
consequence of ''A B & B A' students using procedures that are 
restricted to logical consequence (such as VERIFY) must cite sufficient 
support to reduce the inference to a purely logical inference. In the 
example in Figure 4 the student must cite the axiom of extensionality 
and the definition of subset in order to use VERIFY. In Figure ,5 the 
student need not cite any prior results or definitions when using 
ESTABLISH. Thus, ESTABLISH is considerably easier to use than VERIFY 
because students can directly express well understood set theoretic 
inferences without first analyzing them into purely logical inferences. 
Such analysis not only disrupts the student's concentration, but also is 
difficult to do. Even the most experienced logicians and mathematicians 
(much less students) have difficulty ferreting out all the axioms, 
definitions, and theorems needed to reduce inferences within a theory to 
purely logical inferences. 

The domain of ESTABLISH is theorems in elementary set theory but we 
do not expect it to prove all the theorems in the set theory course 
since it was designed to handle only the simple set theoretic inferences 
and theorems that occur in the course of larger proofs. It is difficult 
to give a good account of the scope of ESTABLISH. The most rigorous 
approach is to provide a general characterization of simple set 
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theoretic theorems and prove a completeness theorem statinjc; that a 
formula is provable by ESTABLISH just in case it is a simple set 
theoretic theorem according to the characterization. There are two 
problems with this approach. The first is that there is no such 
characterization known to us (or, at the time of this writing, to anyone 
else), and, the second is that the procedures used by ESTABLISH are 
rather complex and any proof based upon induction over those procedures 
would also be rather complex* 

We informally characterize the range of ESTABLISH as those theorems 
which are regarded as ""simple' by any standard text, e.g., 
Suppes (1960). More precisely, our set theory course (based upon 
Suppes' book) contains approximately 600 theorems. If by ""simple' we 
mean the earlier theorems (the first third) then the result is that 
ESTABLISH will prove about 85% of those first 200 theorems. It should 
be noted that the equivalence between simple and early is rather rough. 
Some of the early theorems (e.g., the Schroeder-Bernstein theorem) are 
not simple . We give below (Figure 6) some examples of theorems that 
ESTABLISH can prove. ESTABLISH does more than simply prove theorems of 
boolean algebra. Note that none of the theorems below is a theorem of 
boolean algebra. In the following: ^ ' is used for power set, ^ ' is 
used for cartesian product , and "" ' is used for both union and 
generalized union. 

1) <x,y> = <u,v> iff X =» u and y = v 

2) If t2very element of B is a subset of C then l^B C 

3) A ^ B iff ^A ^ ^B 

4) A><iB = B?<'A iff A is empty or B is empty or A = B 
Figure 6. Some examples of theorems that ESTABLISH can prove. 



The actual mechanisms used in ESTABLISH are too complex to fully 
describe here but the following explanation should give a good sense of 
the general process. ESTABLISH is based upon natural deduction 
heuristics and reduction procedures that simplify an inference and then 
use resolution or decision procedures as appropriate to attempt to 
decide the simplified formulas. During this reduction or simplification 
the needed axioms,- definitions, or theorems are selected by the 
reduction procedures. ESTABLISH uses all the standard kinds of 
reductions, e.g., expanding defined notation. 

The main step in ESTABLISH is the reduction or simplification 
process. Unless this is done with considerable care the result is 
likely to be an enormous formula in primitive notation. ESTABLISH 
controls the size of the resulting formulas by doing reductions in a 
particular order, by using proof strategies (complex sequences of 
reductions) , by using special auxiliary algorithms to determine set 
theoretic truth or falsity where possible, by using special algorithms 
like TEQ and BOOLE v;here appropriate, and by using information from the 
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student about what methods should be used. The order of reductions was 
partially determined by experimentation on the set theory list of 
theorems. 

During the third year of the grant, we will continue to study the 
relationship between traditional (textbook and classroom) proofs, and 
EXCHECK proofs. We expect to continue developing and improving the 
inference procedures of EXCHECK on the basis of those studies. We will 
be particularly concerned with identifying the structural similarities 
and differences between traditional proofs and those produced in 
EXCHECK, both by proctors and students. The structural information will 
be used to develop further modification and extension of powerful 
inference procedures. 



2,5 Elementary exercises and dialogs 



The work described is an extension of the work on ELFS described on 
pages 3-4 of the first annual report. It falls under both the Student 
Aids and More Powerful Inference Procedures sections of the original 
proposal, 

A great many exercises in elementary mathematics take the form of 
the student having to specify finite objects satisfying given 
conditions. An important use of such exercises is in providing 
counterexamples to arguments (or statements in a theory). 

For a large class of these exercises there are programs that will 
generate a tree of formulas from the original statement of the problem. 
We call such trees "verification trees". Essentially, the verification 
tree for a problem constitutes a reduction of the original (usually not 
directly verifiable) condition to a collection of directly verifiable 
conditions (the formulas at the leaves). These trees have the property 
that the failure of the formula at a node in the tree explains the 
failure of formulas at any of its ancestors. 

For example, given the finite sets A and B the student might be 
asked to give an injection F from A into B, i,e,, a function F that is 
one-to-one. If the student forgets to specify a value for an element of 
A, the first response to the student is that the domain of F isn't A. 
The student can then ask: " Why?". The program answers (going towards 
the leaves) that it is because A is not a subset of the domain of F, If 
the student again asks "Why" the program will give a specific element in 
A which is not in B, The program can give a specific counterexample 
since the routines that evaluate the formulas at the leaves provide 
counterexamples when those formulas fail. 

The student can ask other questions besides "why" after an error. 
At the first response by the program ("The domain of F isn't A") the 
student can say: "So?" . The program then moves one step towards the 
root (the original statement of the conditions) and says; "Then F is not 
a map from A into B." The student can again say: "So?", to which the 
program responds: "F is not an injection from A to B." 
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Below we present Che typescript of an actual interaction between a 
student and the exercise system. The problem is the one we have just 
discussed: to give an injection F fron A into B. 

A = {1, 2, 3} B = {4, 5, 6, 7} 

Give an injection F from A into B. 

Specify: the function F 

Let: F = {<U4>, <2,5>> 

F - {<1,4>,<2,5>> 

Let: ok 

[Here the program is called to evaluate the student's 
response. Its initial response is:] 

Wrong, since it is not true that Dom(F) = A 

[The student has forgotten to include 3 — an element of the 
domain A — in specifying the function F. ] 

[The student now asks:] 

: why 

« 

[The program responds:] 

Because it is not true that A C dom(F) 

[The dialogue continues until the student understands what 
his error was and why it led to an incorrect solution to the 
original problem.] 

: j^y 

Because it is not true that (V x) [x ^ A x a dom(F)] 
: t^y 

Because 3 6. A and not (3 y) <3,y> £ F 

[Having seen the reasons why his response was wrong the 
student now investigates the set-theoretic consequences of 
his error.] 



: s^o 

it is not true that (l^ x) [x 6 A x e, dom(F)] 
: so 



it is not true that k<^ dom(F) 
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so 



it is not true that Dora(F) = A 
: s^o 

it is not true that F: A-^B 
: s^o 

it is not true that F: A inj B 



The original problem was to show that F: A inj B 



As the example shows, the deductive chain of reasons and 
consequences generated by the "verification tree" gives the exercise 
system the ability to state reasons for its assertions when it is asked 
"why", and, to state the '^et-theoretic consequences of its assertions 
when "so" is demanded. The highly structured information in the 
verification tree prov:* the semantic base for a dialog with the 
student in which the pr can explain to the student what is wrong 

with the answer. It s )u " be noted that more complex forms of 
explanation are available. In parcicular the program could have said at 
the beginning that: "Because 3 is not given a value by F, the donaln of 
F is not A and hence F is not a function from A to B." 

In the coming year, we will implement a number of exercises similar 
to the example shown here, and will explore ways to refine and extend 
their use. The exercises implemented will range from such simple 
problems as specifying different types of functions and relations to 
complex problems such as those dealing with boolean algebras. Many of 
these exercises will be incorporated as help lessons, because they 
provide for a highly focused analysis of individual concepts. These 
exercises will also be refined to provide an optional analysis of 
correct answers, and to generate correct examples as hints. We will 
also experiment with algorithms to find the best explanation for a 
student's error by choosing the appropriate node in the verification 
tree from ;i3hlch to begin the dialog. 



so 
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Audio and Prosodic Features 



The work on Audio and Prosodic Features is proceeding largely as 
outlined on page 83 of the original project proposal (Suppes (1977)). 
We quote from that page: 

... the syntactic parser and contour-generation 
producers would be reimplemented for efficient on- 
line generation of synthesized speech. 

In the second year, we would have one 
operational remote MINI-MISS machine . . . , continue 
to implement new LPC techniques, and simulate the 
concatenation of affixes to root words. We would 
continue to study the relation of the quality of the 
synthesized speech and the strain on the listener in 
understanding the speech. We would add the semantic 
analysis and story-view analysis to the total 
prosodic analysis and continue improvements to the 
lexicon. 

The implementation and testing of the MINI-MISS machine has been 
postponed in favor of additional testing of data compression techniques 
in order to insure that the machine produces the best quality speech, 
utilizing the most sophisticated and up-to-date techniques and 
components possible. A MINI-MISS machine could be built now, but it 
could not be built to utilize the expected data reduction improvements 
without resorting to an overly complex design configuration to 
accommodate the different possible reduction methods. We discuss this 
issue in more depth below. Section 3.1. 

In the first year of the grant we initiated work on the 
concatenation of affixes to root words, which had immediate application 
' to the maintenance of curriculum. This necessitated delaying work on 
the on-line generation of prosodic features until the current year. A 
fuller discussion of the schedule change is presented in the first 
annual report, Blaine et al (1978). A procedure providing on-line 
generation capability has been Implemented. We describe this procedure 
below in Section 3.2.3. Evaluation of this procedure will proceed in 
the next year of the grant. v/e implemented a simple model of the 
prosodies of a "story-view" of the structure of text and will continue 
to develop this model for implementation into the prosodic generation 
for the curriculums. We incorporated the analysis of some semantic 
features into the syntactic analysis and into the parameter generation 
routines instead of constructing a separate - semantic component. These 
topics are discussed below in Section 3.2.1 and Section 3.2.2. 

We evaluated different aspects of the word concatenation aIp,orithm 
in a rating experiment which is described below, Section 3.3. 
Evaluation of the effectiveness of audio messages in the cour?5e in 
Elementary Logic at Stanford Is also proceeding. 



20 27 



Mlni-Mlss and data rate studies 



\'le proposed to build and test a Mini-MISS machine during this year 
of the grant but we have decided to defer the actual construction of the 
machine pending further research on data compression and interpolation 
of sound data. Construction of the Mini-MISS machine is within our 
capability at any time we chose to implement the current best techniques 
in compression and interpolation* However, we foresee substantial 
improvements to these techniques in the near future whose incorporation 
will make Mini-MISS a considerably more useful machine. We have chosen 
instead to concentrate during this grant year on further research and 
experimentation to increase the degrv^a of compression and the usefulness 
of interpolation for such a machine. We are attempting, during this 
grant, to reduce the transmission and storage bit rate from our current 
9,000 bits per second to a rate of 3,000 to 4,000 bits per second. This 
attempt will be brought to a close in the third grant year with the best 
compression we can achieve . At that time , we will proceed to 
construction and testing of a Mini-MISS machine employing this 
reduction . While other LP systems (e.g. , Texas Instruments "Speak and 
Spell", see below Section 3.1.4) have lower transmission rates than 
we are envisioning, the usefulness of Mini-MISS is predicated on 
maintaining higher quality speech (necessitating higher data rates) than 
these other systems are concerned with achieving. 

In addition to the primary advantage of rate reduction for the 
Mini-MISS there are also two secondary considerations which impel us to 
further research on compression and interpolation, rather than 
proceeding to construction of the machine • A successful interpolation 
algorithm will also enable us to perform better word formation through 
affix concatenation and also to smooth inter-word junctures to sound 
more continuous. The second consideration in deferring the building of 
the Mini-MISS is that micro-chips for parts of the speech synthesis 
algorithm may become commercially available in the near future and these 
chips will simplify the design of the Mini-MISS machine. 

3. 1. 1 Design considerations for the Mini-MISS machine - 

We list here our basic design considerations for the Mini-MISS 
machine. Following the list is a discussion of these criteria in more 
depth. Criteria 1, 2 and 5 were integral to the design of the original 
MISS machine, the third (data rate) was deliberately not in the original 
design and the fourth (interpolation) will be implemented in the MISS 
machine as a way of perfecting the design for the Mini-MISS. 

1) It must be capable of synthesizing a Linear 
Predictive (LP) encoding of speech in real time. 

2) It must be capable of performing prosodic 
manipulations for sentence formation by word concatena- 
tion. 

3) It must have a sufficiently low data 
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transmission rate, so that the speech data CORecher with 
curriculum display commands can be sent to it over a 
(leased) phone line, i.e., at a combined data rate of 
approximately 4800 bits per second. 

4) It raust be able to perform interpolation of 
sone format of compressed LP coefficients : reflection, 
area, log-area ratio or formant. 

5) It should be micro-coded but should not contain 
more elaborate hardware facilities than are needed for 
satisfying the other design criteria. 

Linear Prediction has proven itself (see Sanders and Laddaga (1976) 
and Laddaga and Sanders (1977)) as a high quality, efficient technique 
for speech synthesis. The manipulation of fundamental frequency, 
duration and loudness when concatenating words to form phrases and 
sentences imparts needed naturalness to the pr^. «ess of generating audio 
messages for a curriculum. While recording entire phrases provides more 
natural sounding speech, the requirements for storage and recording of 
new phrases make this approach too costly and time consuming. Also 
there is less flexibility in the design, production, maintenance and 
revision of curriculum materials using recorded phrases, due to the high 
turn-around time required for recording the phrases . These two 
criteria, the use of LP and the prosodic capability, are critical to the 
usefulness of the Mini-MISS machine. 

The importance of a low transmission rate is that it allows a 
remote student terminal to access an audio lesson without requiring a 
second phone line since both audio and display commands can be sent over 
the same phone line. If a higher bit rate for the audio messages was 
used, a second phone line would be needed for remote access to transmit 
the audio messages. In addition to the transmission rate, a low bit 
rate permits less expensive storage of speech data at remote sites, thus 
providing the potential for more extensive use of the audio capability. 
In order to provide the lower transmission rate, the Mini-MISS must be 
capable of interpolating LP parameters, with little degradation of 
quality. Interpolation is necessary because only a small decrease in 
the current bit rate could be accomplished by using a more elaborate 
coding of the parameters to optimize the information content of the 
coding on a f rarae-by-f rame basis . We discuss interpolation below. 
Section 3.1.2. 

Our final criterion is that the Mini-MISS be micro-coded but not 
"over-designed. " We could utilize expensive and sophisticated micro- 
processors to build a machine that would have more capability than any 
single design for interpolation of LP coefficients with prosodic 
capability could require. This would be a misplaced effort, since such 
a machine would be largely unused and excessively expensive. We intend 
to design appropriate components for a particular synthesis (de- 
compression/interpolation) algorithm. In particular, we are conducting 
an experiment, described below. Section 3.1.5, to determine the 
parameters and utility of a specific interpolation method. The use of 
micro-coding is still important since this decreases development time 
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and cost, and, provides flexibility in addinj; improvements within the 
chosen technique. 



3,1,2 Data compression and interpolation 

We are investigating the use of interpolation for data compression 
of LP coded speech sounds. Interpolation is a method, to reduce the 
redundancy of information contained in LP analyzed speech across long 
sequences of data points. It is thus an approximation to the original 
LP representation, but one that can preserve the perceptually important 
features of that representation. Section 3,1.5 describes our 
experiments investigating the relationship between the parameters of 
interpolation and perception of speech quality . The importance of 
determining the form of interpolation to the design of the Hini-MISS 
machine is discussed below. Section 3.1.3. 

There are several representations for LP coding of speech which are 
mathematically equivalent (See Makhoul (1975) for a general introduction 
to LP, and Levine and Sanders (1978) for a discussion of the different 
representations.). Interpolation of LP coded speech is dependent on the 
representation of the LP "transfer function" (Makhoul (1975)) which is 
shown in equation 1. 

G 

A(z) = (1) 

P [ -i ] 

1 - 2 [ a * 2 ] 
i_l [ i ] 

In equation 1, A(z) is the transfer function, G is the amplitude of the 
original signal, p is order of the polynomial (in our case usually 12), 
the a's are the constant coefficients of the function, and z is a formal 
parameter* Equation 2 gives the form of the transfer function in terms 

G 

A(z) - (2) 

P [ -1 ] 

prod [ 1 - R * z ] 
i_l [ i ] 

of it complex roots, R^, which occur in complex conjugate pairs. Each of 
the (complex) R^ , can be represented, as in equation 3, in polar 
coordinate form as a modulus, r, and an angle, or? 

R = r * e [j=sqrt(-l)] (3) 

i 

cr- 

F = — - (4) 
i 2 TT 

-1 

B = * log ( r ) (5) 

i 2:n: e 
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The "pseudo-formant" frequencies, F^, and bandwidchs, B^, for each 
complex conjugate pair is computed as in equations 4 and 5 where the 
frequency and bandwidths are numerically expressed as a fraction of the 
sampling frequency. Since the LP coding is itself an approximation to 
the original speech signal, the and are not necessarily the actual 
formants of the signal, hence the term *'pseudo-formants". We will 
return to our method of assigning the pseudo-formants obtained from the 
LP coding to particular formant slots. Since the order of the LP model 
is even, real roots also occur in pairs and provide gross spectral 
shaping. They will also be considered pseudo-formants here. 

The transformation to pseudo-forraants from other LP representations 
(such as reflection coefficients) is exact and invertable. The purpose 
of doing the conversion is that the pseudo-f ormant frequencies are 
smooth functions of time and thus likely candidates for interpolation 
techniques. The bandwidths due to the vocal tract are also quite smooth 
functions of time but LP analysis is sensitive to small positional 
variations • However, errors made in the estimate of the bandwidths are 
less important perceptually than errors in estimating the formant 
frequencies. This relative unimportance to perception makes it feasible 
to interpolate bandwidths even when they have large variances. 

There are several considerations which are consequences of this 
approach to LP interpolation. It is critical to have an accurate and 
robust polynomial root finder in order to deal with the particular 
polynomials that result from the LP model. Our original root finder was 
modified to yield reliable results by saturating arithmetic overflow and 
underflow. It is also critical to shrink the unit circle in the domain 
of the polynomial to provide a convergent solution for the initial 
affricate /dj/ as in "jack' (and similar sounds). A convergent solution 
of the polynomial would not otherwise be available. 

A further consideration for interpolation is numerical 
representation of real roots of the LP polynomial. While a pair of 
broad band complex poles c^n meet on the real axis with little effect on 
Che frequency response, our present representation of the real poles is 
discontinuous at the point of approach to the real axis. We are 
proceeding to work on representing these poles in a way that is suitable 
for interpolation. 

The other major consideration for using the formant and bandwidth 
representation of LP is that a polynomial root solver does not order the 
roots in any natural way* This fact discouraged previous researchers 
from considering formant frequencies and bandwidths as a practical 
method of encoding the LP model* We have developed a number of 
(automatic) heuristic constraints which order pole pairs so that 
formants are numbered in an appropriate manner. The heuristic can over- 
rule the simple frequency ordering to preserve the smoothness of the 
first 4 (and sometimes 5) pseudo-f omants. Any resulting broad band 
pole pairs are assigned to the highest available pseudo-formants after 
the real pole pair (if it exists) is assigned to the 6th pseudo-f ormant. 
Unvoiced speech regions are analyzed x^ith fewer poles (3 or 4) than 
voiced regions, so the extra (zero) poles are assigned in a heuristic 
manner . 
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This approach to ordering the formants yields a great deal of 
smoothness in Eormants 1 through 5. Thus, they are well suited to 
linear interpolation. Formant 6 concentrates much of the Erame-to-f rame 
variability in the LP coefficients, and so is poorly represented by 
linear interpolation- This situation is somewhat paradoxical because 
the broad band pole pair is the least perceptually significant and a 
simple representation j^hould be sufficient. Efforts are underway to 
simplify the representation of pseudo-f ormant 6. 

3.1,3 Interpolation and the design of Mini-MISS 

The design of the Mini-MISS machine will critically depend on the 
arithmetic requirements of the interpolation and data compression 
me thods we adopt . The current MISS machine has a humming machine 
(Levine and Sanders (1978)), which is essentially a general purpose 
microcomputer whose function is to manage the speech data being 
transmitted from the PDP-10 and to perform the prosodic calculations • 
In the Mini-MISS the humming machine would also have to do the 
interpolation calculations and whatever coefficient transformations are 
necessary for those calculations, followed by a final conversion to LP 
reflection coefficients for synthesis by the digital filter part of the 
machine. 

One example of how the design of the humming machine depends on the 
compression methods used is the question of how much special purpose 
processor power is required for interpolation. Different special 
purpose sub-components may be needed depending on which representation 
of the LP coefficients (e.g., formant and bandwidth) is used for 
interpolation since that choice will determine the arithmetic complexity 
of converting from this representation to reflection coefficients for 
digital filtering. For example, some conversions would require a fast 
divider while others would not. 

In addition to the important data rate reduction which a good 
interpolation method will provide, it will also be useful for ^affix 
concatenation and word junction smoothing. We are investigating word 
formation by concatenating affixes to word stems as part of this grant. 
Important work on affixing was done during the first year of the grant 
(see Blaine et al (1978)). A major improvement will result from the 
availability of interpolation to smooth the boundary of the stem and 
af fix when they are concatenated. One of the conclusions of the 
experiment rating relative unnaturalness of word concatenated utterances 
(reported below. Section 3.3.1) was that the use of individual words 
contributed the most to the overall unnaturalness of concatenat d 
speech. Here again, interpolation can help to overcome t.te 
unnaturalness of word junctures and .provide smoother sounding 
utterances . 
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There are currently two other apofich synthesis units wh Ich may 
become connercially available in the near future. Telesenaory Systeng 
Inc. (TSI) ia developing a pair of chips which will perform the digital 
filter portion of the Mini-MlSS. As mentioned above, the Mlnl-MISS will 
contain a filter section which is fed hy the humming machine. If the 
filter chips are available from TSI in time to be included in our final 
plan for the Mini-MISS, including it will simplify the design. Since 
the current MISS microcoded filter is reasonably adequate for Mini-MISS 
as well the availability of the TSI chips is not critical to the 
successful completion of the grant. 

Texas Instruments has recently begun marketing "Speak and Spell", 
which is a device using LP coded speech to speak words from a limited 
vocabulary and spell them out loud. This device is oriented towards 
drilling spelling at low cost. While the technological sophistication 
of the synthesis chip in "Speak and Spell" is considerable, it has 
internal hardware constraints which make it unsuitable for the quality 
of audio synthesis which is necessary for our CAI applications- 
Furthermore, the chip is designed in such a way that these constraints 
can not be overcome by incorporating the chip into a larger synthesis 
sys ten. 



3,1,5 Experimental design for compression and interpolation experiment 

We stated above (Section 3.1.1) that one design criterion for Mini- 
MISS was the ability to keep speech quality high at a data rate of 4,000 
bits per second (or lower). We are conducting an experiment^ to test the 
usefulness and parameters of linear interpolation of a "formant and 
bandwidth' representation of LP coefficients as a method for reducing 
the data transmission rate. An experiment using the judgements of human 
subjects is necessary to determine the parameters of interpolation and 
thereby to establish the usefulness of the method. Once the experiment 
is completed the design for the micro^coding of Mini-MISS can be 
completed and the hardware construction will quickly follow. 

An approximation of real data by a simple smooth contour, such as 
linear interpolation of LP coded speech, is crucially dependent on how 
the error between the smooth contour and the real data is measured. 
Current practice in LP interpolation is to define the error as a 
function of the spectral difference between the model and the real data- 
A problem for this measure is that it can easily lead to perceptually 
inconsistent results. There is much disagreement between spectral 
differences and perception since the the speech perception space is not 
necessarily convex nor is the difference measure necessarily Euclidean. 
Perceptual distance limens for formant frequency and formant bandwidth 
correlate poorly with their corresponding spectral differences. VTliile 
spectral difference has long been used by the engineering community in 
the design of audio amplifiers, the spectral differences in that domain 
are on the order of 1% or less. Much larf>er differences are necessary to 
achieve a low frame rate interpolation technique, and at the larger 
magnitudes the spectral and perceptual differences diverge. Without 
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lujliip, a por::iipl:ual inodol, thn spoctrnl cllffuronco mcatluul can not tako 
ailvnntaf'B of poH.sLhly ^^^reatar Lnturpolacions in particular c Lrcuma tannoM 
without rlakinc? an unaccep table number oE barl prerllctlons . 

The experiment we are performing Involvoa a almplo perceptual model 
of speech representation which employs paeudo-f ornanta and bandwldths 
(see above Section 3.1»2)^ Present perceptual models are bawled on very 
simple phenomena, guch as the perception of steady state vowels. The 
prediction of complex speech phenomena from such models is not at all 
straight-forward. Our studies during this grant have indicated that 
development of a good general perceptual model is still quite far off, 
given the current state of research. As an alternative to a conplex 
model, we are trying to develop partial decision criteria for 
constraining the utilization and interaction of the simple perceptual 
models and the g«}neral techniques such as spectral dis tance measures • 
In the course of our experiments, we will specify some of these decision 
criteria a^d exarlne the quality of resultant interpolated speech. 

We ha ''e devp loped a successful pseudo-f ornant acoustic model that 
separates smooth*' y varying LP parameters from the irregular parameters 
of standard LP analysis. This success allows us to define the error 
between the smooth contour and the real data in terms of this model. In 
equation 6, 



^ =» (1/n) * 



[ S - a*(ti-tO) 

n-1 [ ti 

S 1 

iO [ S 



- b 



I 



ci 



(6) 



is d^t.ined as ti.e mean squared fractional difference between the real 
data and the line which starts at tine '^tO', at position ^b' with slope 
""a'. S^^ is the coefficient (in this particular LP representation) at 
time tl. In this equation, ^a' is the only parameter available for 
minimiziaii che perceptual distance between the smooth line and the real 
data • 
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(7) 



Equation 7 gives ''a' as the optimal slope found by setting the 
derivative of equation 1 (with respect to a) equal to zero, 
minimizes 



Thus, ^a' 
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Altlioup;h the mean «c|uaro(l frnctlonnl orror, fvlvfjn In ef|unUlon 6, 
la a tniccable apGclE Icaclon oC an InCGrpolaUion crlcorlon, Eilmplo 
avoraRing can lead to unaccoptablo flituaciona. IC the real data la well 
approximated by a straiRhc line for a lon^ period but then dlver^ea 
sharply from its previous path, averafilnp; will cause the fJCralRhc line 
to continue well past the point oE diverr^ence. Equation R provides an 
additional error definition for the interpolation. 



[ S - a*( ci - cO ) - b 
n-1 [ ci 



max [ 

i_0 [ S 
[ ci 



(8) 



We define ^ in equation 8 to be the maximum component ofc<. By using 
to compute ^a' and using both << and in the decision criteria shown in 
equation 9, 

Decision criteria (reject if): 

^ > threshold^ or ^> threshold^ (9) 

we can keep both the average and the maximum error within acceptable 
bounds* 

The f ormant/bandwidth representation of the S has four disjoint 
numerical regions: .5>S>0, S = 0, 0>S> -.25 and -.25 > S > -.5. 
This provides the interpolation with an additional decision criterion: 
interpolated pseudo-f ormants and bandwidths never cross region 
boundaries. 

Another issue that will be examined as part of this experiment is 
the allocation of the total acceptable approximation errors among the 
various f ormants and bandwidths. Not all f ormants (or bandwidths) will 
be equally sensitive to errors of interpolation, but without a general 
perceptual model it is difficult to decide what the best division isi A 
series of experiments conducted by Flanagan (summarized in 
Flanagan (1972)) provides a basis for initial allocations to the first 
and second fomant frequencies and bandwidths. Initial allocations to 
the other parameters have been based on informal listening tests- The 
experinenc we are currently conducting will improve these allocation 
assignments, which are crucial to a successful interpolation. 



3.2 Lin<yuistic analysis 

We have implemented one simple model of a "story-view" analysis of 
dialogue text, and we are beginning work on the generation of paragraph 
prosodic contours using simple models of the "story-view" of the texts 
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xl^ our currLculuin. We have not clLroctly purHUod jioinantlc nnalyflln of 
acinccmcog , but wa have increased the QophigtlcatJon of: our uyntacclc 
nnalyois to accompll.sh many of cho f^oals chat would bo derived from 
semantic acudiea. V/o have Lmplemonted a procedure for addinp; on-line 
ganeratlon of audio mesnaRea In the curriculum. 

3» 2* I Story-View and paragraph prosodic contoura 

The DIALOG program generates audio niessapes from a prepared text 
file using multiple voices derived from a single recorded vocabulary. 
It serves as a tool to determine the parameters of a simple model of 
**story-view" prosodies. While there are many differences between 
speakers, or even between different utterances by a single speaker (for 
example, in the phonetic content of utterances ) , the model we are 
investigating specifies some constraints which can be implemented in a 
prosodic generation system such as ours. The constraints which are 
manipulated in this program isolate the pitch, speed and loudness of 
each sentence as being capable of defining separate voices. The pitch 
of a sentence is defined as the maximum fundamental frequency of any 
word in the sentence ; speed is the factor by which the entire sentence 
is speeded or slowed from the neutral sentence assignment ; loudness is 
the factor by which the loudness contour for the entire sentence is 
modified. Sentential prosodic generation procedures are left unchanged 
by manipulation of these constraints in DIALOG, the results of the 
sentential procedures are simply relativized to the above constrains. 
The model implemented in DIALOG indicates some perceptually important 
parameters for determining continuity and discontinuity of discourse and 
is therefore useful^ in developing a "story-view" model. 

We are beginning to implement and test a simple model of paragraph 
scope prosodic contours in the curriculum- This model assigns 
constraints on the sentential prosodic assignment algorithms from 
limited information derived from the (abstract) structure of paragraphs, 
and, syntactic and semantic heuristics based on sentence length and 
explicit semantic markings in the text. The model manipulates the same 
constraints, pitch, speed and loudness, as the DIALOG program. 

3.2.2 Semantic and Syntactic Analysis 

We proposed to add a semantic analysis to the linguistic analysis 
routines which provide the prosodic parameters for intonation synthesis. 
That proposal implied a separation of the semantic and syntactic 
components of language analysis . We have decided instead to pursue the 
improvement and expansion of our syntactic and assignment components to 
include semantic considerations rather than explicitly add semantics to 
our system. This decision is based on results of our early work to 
include semantic components. 

In the process of adding an explicit semantic component to our 
analysis we decided to incorporate local, somewhat ad-hoc uses of 
semantic information into the syntactic analysis to make it more 
sensitive to semantic and lexical facts. Also the assignment of 
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proaodlc parninoUiirM troin tho HynCacflc ntructura ot' n aontonro han boon 
ncl.iuatud to make It mora HonaUive to oortnin nGinnntlc Ceaturoa, Tho 
ayncnccic component handlen pronouna in a mora complex way, doivlfllnp; 
when to consLdcir thorn as empty anaphora and when aa content worda. The 
a^ial^nment La now aenaitlve to theao anaphora and nlao to nomma 
Intonation- Theae adjustments of the syntactic and naalr.nnent 
components have been faLrly effective in providinp; the naturalneaa wo 
expected to obtain from adding?; explicit aemantica. 



3,2,3 On-line p:eneratlon 

We have implemented a procedure this year, called RSPEAK , which 
enables us to generate prosodic contours for sentences which are at 
least partially syntactically analyzed. This procedure largely answers 
Che need for on-line generation of audio with reasonable intonation. 
For generating audio messages for proof summarization, a parameterized 
message, such as " proves that /? holds when If is also true" is 
syntactically analyzed and stored (whereof, y(9 and y are string variable 
place holders for potentially complex phrases). Substitutions for these 
variables are also stored in a partially syntactically analyzed form. 
In the sample parameterized message given here, we could, for example, 
substitute "theorem 31" foroC, "the conditional P implies Q" for/9, and, 
"for all X, X implies r*' forV. In this example, / is itself a 
parameterized message; f'is a string variable, and might be substituted 
by "S of x". 

\^en the message is generated, substitutions for variable strings 
are made and the entire partially analyzed sentence is sent to the 
BSPEAK procedure which efficiently generates prosodic parameters and 
directs the MISS machine to begin playing the sounds. Using the 
substitutions suggested above, the sample message would read 

"theorem 31 proves that the conditional P, implies holds when 

F 

for all X, X implies S^ of. x is also true." 



This sentence would require a relatively large amount of time (on the 
order of several minutes with a heavily loaded system) for syntactic 
analysis if it wexe to be parsed as such, but since each smaller string 
was analyzed at the time of storage (some with variables) no additional 
syntactic analysis is necessary. Since BSPEAK uses a context free 
grammar to generate prosodic parameters, it can infer reasonable 
parameters for rhe un-analyzed portions of the message in light of the 
structure of rhe message. Therefore, even if one or more of the 
variables in ^he example above had not been pre-stored with a syntactic 
analysis, BSl'EAK would still have been able to generate a reasonable set 
of prosodic parameters for the sentence. 
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.3.-1 St:tifly af. prn^ind I n g un VI t:y 

V/o have utlllj'.eul two mennfi oC avnluntJn^ the (innlity oP 
pro^Kul Icaily mocllElod word-concaconntod utcornncea . Wo conductad an 
o;<par Iment , vSnndfirn et nl (1078), In wlil ch twenty nnlvo auhJectM rntad 
olj!;ht varlacLona of cwnnty aoncencofl. The vnrlntlona oC the aantoncoa 
Ifjolatod the pitch, duration and loudnega componentn of our proaod Ic 
model. The other means of evaluatLnp; proaodlc quality we used IvS the 
modeling of atudent use of audio In the courae, ''Introduction to Lon;lcJ' 

3» 3* 1 RatlnR experiment 

LP encoding separates speech into four component contours: pitch, 
duration (rate), loudness, and spectrum. There is a generally 
recognized hierarchy of intonational effects, in which pitch contour has 
first place, duration next, loudness third. Segmental information is 
outside the hierarchy. The purpose of this study was to discover, for 
the current MISS system, the relative contributions to unnaturalness 
made by these factors, in order to appropriately direct efforts to 
improve the quality of word concatenated speech. 

Twenty sentences were chosen from a collection of sentences used in 
a course taught at the Institute. They had all been recorded by the 
speaker who also recorded the lexical words. The sentences selected 
contained a full range of English phonemes, and represented a variety of 
syntactic types: declaratives, questions, and imperative sentences. 
Word boundaries in each recorded sentence were marked as accurately as 
could be determined by graphic and perceptual means. The sentence texts 
were parsed by MISS's parser, and the word durations, amplitudes, and 
pitch contours were calculated. The recorded sentence was then modified 
to have some artificial (MISS generated) characteristics while retaining 
some of its original character. 

Eight different treatments of each sentence were prepared. 

1. The recorded LPC processed sentence (recorded). 

2. The recorded sentence with MISS's amplitude modification 
replacing the original (artificial amplitude). 

3. The recorded sentence with MISS's duration assignments 
replacing the original word durations (artificial duration). 

4. The recorded sentence with MISS's pitch contour replacing 
the original pitch contour (artificial pitch). 

5. The recorded sentence with spectral coefficients from 
individual words as stored in the English dictionary 
replacing the original coefficients (artificial 
coefficients ) . 

6. The sentence as produced by lexical word concatenation with 
prosodic modification of pitch , duration and amplitude 
(artificial prosody). 

7. The recorded sentence with artificial pitch, artificial 
duration, artificial amplitude, and artificial coefficients 
replacing the originals (artificial composite). 

8. The recorded sentence with a previous version of MISS's 
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plroh contour roplnnlnK tho original p.I|:ch nontuur (olrl 
p Uch) . 

The auhjootn yato fAvHt j^Lven a doFinitlon of: nntuiralnona and v/oro 
told to rata tho nantoncoa thoy worn about to haar From I Co 9, with 0 
and 10 nB ovorfiow catofVH" loa • Wu choHu to nau nalvo Eudr|onta t:o avoid 
tlia posaibllUy tliat provloua oxpoauro to auch apoonh mlrjit hava 
InCluencod the oxperlencod Uatoner'a notion oC naturalneaa. 

The means o£ each Ureatmonc (avoraRod over aoncencoa and n^ibjocta) 
showed aignif Leant and Intortssc Lnp; diEEeroncea, The mean for each 
treatment was tested against the other means uainR a t-teat. The t-teat 
showed that while the recorded version did not diEEer signiElcanUly Erom 
the artificial amplitude version and that the artiEicial prosody 
sentences was neither better nor worse than the artiEicial compoalto 
sentences, all the other diEEerences in the means were atatlatically 
BipniEicant below the .001 level, so that we can order the mean scores 
in six groups as shown in Figure 7. 



Group 


1 Treatment: 


Mean 


S.D. 


1. 


recorded 
artiEicial 


amplitude 


6.705 
6.565 


1.889 
1.812 


2. 


artificial 


duration 


5.443 


2.102 


3. 


artiEicial 


pitch 


4.540 


2.098 


4. 


old pitch 




4.113 


2.093 


5. 


artiEicial 


coefficients 


2.320 


1.742 


6. 


artiEicial 
artificial 


composite 
prosody 


1.840 
1.835 


1.372 
1.420 



Figure 7. Means and Standard Deviation of the Naturalness Scores. 



These preEerences indicate that, as expected, the recorded 
sentences are r.ore natural than the artificial prosody sentences. More 
inportanti-r, the significance of the preference ordering shows which 
artificial elements contribute the most to the perceived unnaturalness 
of artificial prosody sentences.' Substituting artificial spectral 
coefficients for the original sentence coefficients makes the greatest 
dlEEerence in naturalness of any of the single changes. 



3,3,2 Analysis oE student preEerences 

The course, "Introduction to Logic", is given three times per year 
Eor Eive units oE university credit and taken by approximately 250 
StanEord students each year. To test the eEf ectiveness of audio in our 
curriculum, we offered a choice of display-only and display-with-audio 
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w,tjuuri|t I Of) by f',lvhiyi, tU:iu|BiU:H in th^ \<m\.Q VMW^^ t^irlun" \:tuu^r^\^H\ 
phr^uioH ([.H-inofl^O or prDHrul loo Uy n^MUirato^l t^anc^ncBs:} (P-mo^le) tUa 
niuILn VHrtii-on of r:ho courne and nllowiny^ tUm to nna cha (llt^play-only 
vurnlnn If l^h^y wUihtidi aiMnlPntH waro annourn^ad (.0 cvy hnlh varHinnJi 
ami t:n nwlnoli hrnwoun VMrnlnna whnnnvar l-ht^y fnnnd II' luilpfuU Tluvy 
wnru InformtHl char: I hui r K<-fHUu^ In no way ilH|uMulafl on wh trh vor^sion of 
uho convBci cluiy luit^^d or Uou oft^on Mu^y tiwl t-ohi^il. lly sent UH^ i oa l ly 
iniia«iirln« and inodoltrm fho rolarlvii praforond^M of l«H-innda vt^ryiin 
dUplay, and P-modt3 varana dlaplay-onlv (OPY-niodv^i MMl-lniaCa ih^ 

rolatlvn nafuralnoaa of [Mo two muhui of MTS:1 aynnh^al-^, Thraa 
tixptirlmrtnl:a wero nnadne.tod fo colloal- dal^a on nlnilont pfoforonpe^t Tn 
tho tlrat oKporLmont ww conparod only I.S and DPY inodaa; In Iha atiVnnil, 
wo compared and P modoa via rolaCiVQ pr<if tu^tinoo for OPV Piodo; In 
cho third, we allowotl a froo choice of t:hro(j inodoa, 

r.n thfl flrat cxperlmanl:, ntudonta waro aplU Into two ^roopa, one 
l)OKlnnlnK la l.S-modo, tha otlior In DPY-modo for an lul.Ual MMRm^nt of 
cho conrf3e% Both gronpa woro chon awltchad to tho aUcirnato niodo for a 
second HO.^mont of the couraB, For thoao Inlt lal aaRinanta, tha atndanl; 
had no choice over which mode (DPY or T.S) wan preaontod In 'which loaaon* 
Doth groups wore free to choone either mode at: each 'Mo^i;-ln** from than 
on • 

The results of the Eirat experiment, whown In Flgura 8, indlcatad 
that audio was not an overwhelming Eavorite; only about half the log-inn 
for the portion of the course in which atudenta had a choice were audio 
log-ins. The percentage of audio log-ina actually begins higher but 
decreases as students progress through the course. The poccontngo of 
audio chosen was not particularly affected by the amount of forced 
exposure to audio (3 lessons in Winter '77, 1 lesson in Spring '77) they 
had. 

Term Lessons with Wi th Log-ins Percent 
no choice choice (choice) in^ audio 

Winter '77 1-6 7-18 1278 ^9 

Spring '77 1-2 3-20 2742 48 



Figure 8. Results of the first audio/display experiment. 

The second experiment was very similar to the first, except that 
half of each of the two groups was using LS-mode when they received 
audio, while the other half was using P-mode. The results for this 
experiment, shown in Figure 9, are somewhat clouded by the fact that in 
the autumn quarter there were errors in . the course software and 
curricuiuD which affected only the audio versions. By the winter 
quarter these problems had been mostly corrected for P-node, but not for 
LS-mode (hence the poor showing of LS-mode versus P-mode for that 
quarter). The results of this experiment indicate that P-mode is as 
acceptable to students as LS-mode, despite the difference in quality. A 

33 



School 




Lessons 


Logins 


Percent 


Cotnraent 


Term 




Forced 


Choice 


w/ choice 


of Lof^lns 




Spring 


'77 


(none) 


21-29 


889 


30 LS-mode 


See 










763 


26 P-mode 


Figure 8 


Au tumn 


'77 


1-2 


3-29 


1759 


20 LS-mode 


Errors In 










1941 


22 P-mode 


audio curric. 


Winter 


'78 


1-2 


3-29 


1601 


17 LS-mode 


Errors f Ixed 










1524 


34 P-node 


In P-mode only 



Figure 9. Results from LS-mode and P-mode vs. DPY-mode experiment. 

significant reason why students favored DPY-mode can be attributed to 
the errors which existed In the audio version of the curriculum. 
Results for Spring '78 and Autumn '78 are currently being analyzed. 

A third experiment was performed with the first four lessons 
(approximately four hours) of the logic course over a two week period In 
autumn 1977, and repeated In 1978. For the four-lesson course, 64 
Stanford freshmen were given a three-way choice: DPY-mode, LS-mode and 
P-mode, with a brief explanation of what the modes Involved. These 
students constitute the only group so far to have a choice between LS- 
mode and P-modes In audio. In addition to the option of choosing DPY- 
mode. The results from these two years are summarized In Figure 10. The 
percentage of audio log-Ins remained fairly constant In both years, but 
the preference for audio, the number of students who logged-ln more 
times in one of the audio modes than In display, was greater In 1978. 
Similarly, the preference for LS-mode over P-mode, among those whose 
preference was for audio, was also considerably higher In 1978. F-mode 
compared respectably In a head-to-head confrontation with LS-mode In 
this experiment. 



Number of Students 
School total Percent In Preferred Preferred (Audio) 

year lofi^-lns Audio Audio Dpy Mixed LS- P- Mixed 

Mode 

'77 402 62 25 33 6 18 7 

'78 357 69 43 16 5 36 5 2 



Figure 10. Results from three-way free choice experiment. 

Another experiment Is presently under way (Winter '79) in which the 
students' tendency to use audio is compared to the extent of previous, 
forced exposure to audio. For this experiment students have a choice 
only between P-mode and DPY-mode. The students are divided into four 
groups : 
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1) short exposure to P-mode then lonp, exposure to 
DPY-mode; 



2) short exposure to P-mode then medium exposure 
to DPY-mode; 

3) short exposure to DPY-node then long exposure 
to P-mode; 

4) short exposure to DPY-mode then medium exposure 
to P-mode • 

By **short exposure** we mean approximately a half hour; ''medium exposure" 
is about three hours, and, long exposure is nine hours. These times are 
mean student times from previous quarters of the course for the lessons 
where the exposure to P-mode or DPY-mode was f orced« This experiment 
will be repeated for Spring '79 using better prosodic techniques that 
have been developed during this grant. 
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4 Schedule oE Proposed Research 



4.1 InEorcial Mathematical Procedures 

The schedule given here for completing the research proposed under 
the informal mathematical procedures portion of the original proposal is 
a more explicit version of the schedule on page 82 of the original 
proposal. 

We will continue the development of natural inference procedures 
and will extend the implementation of the interactive theorem prover to 
handle complex proof procedures. In particular, the basic Pfo^^ures 
used in the set theoretic version of ESTABLISH will be refined and 
extended to enhance its performance on more difficult inferences in set 
theory. Complex reduction sequences (proof strategies) will be 
implemented. The finite structure mechanisms including the dialog 
procedures will be extended and implemented in the logic course. 

Sophisticated mechanisms for providing guidance in the set theory 
course based upon suggesting applicable reductions or strategies will be 
extended and refined. These mechanisms will also be implemented for the 
logic course. Guidance based upon re-execution of stored proofs will be 
implemented in the set theory course to provide guidance for those cases 
beyond the capability of the other guidance procedures. 

A major focus of our work in the third year will be the evaluation 
of the interactive theorem prover. powerful inference procedures and 
guidance mechanisms. For more detailed descriptions of the ev^J^JJ^"'^ 
procedures, see the appropriate sub-sections of Section 2. In addition, 
work will continue on streamlining and improving the course driver, 
EXCHECK. to facilitate the student's interaction with the computer. 
Dissemination of research results will be carried on through continued 
publications, presentations and conferences. 



4.2 Audio and Prosodic Features 

The schedule for completing research originally proposed for the 
third year of the grant is substantially the same as that given on page 
83 of the original technical proposal. We will implement concatenation 
algorithms and additional prosodic analysis and Re"^"'^^"" 
We will continue to study the quality of. the synthesized speech as 
described on pages 65-66 of the original proposal. 

In the next year of this grant we will complete the specification 
of desipn criteria for the MINI-MISS, including data compression and 
interpolation algorithms. Important specification information will be 
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derived from the results of an experiment on Interpolation (see Section 
3.1.5 above) to be completed in this third year. We will construct the 
MINI-MISS machine satisfying the design criteria and test it in the 
field. We will also continue to implement new LPC techniques (see pa^es 
58-59 of the original proposal) and work will continue on improving the 
algorithm for digitizing individual words (see page 63 of the original 
proposal) . 

In addition to the experiment on interpolation, evaluations of the 
quality of MISS speech production and prosodic assignments will 
continue. Dissemination of the results of our research will be carried 
on through continued publications, presentations and conferences. 
Especially important will be the presentation of our findings on data 
rate reduction. 
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Appendix jA 
Dissemination of Research 



An important part of the work carried out under this grant has been 
the dissemination of the results and conclusions of the research done 
under support of the grant. Papers and presentations have been given at 
national and regional conferences concerning computers (ACM), education 
(CECC and ADCIS) , and, speech research (ASA). Below we list the 
publications and presentations of the principal investigator, Patrick 
Suppes, and the publications and presentations of the other staff 
members. Some of these publications and presentations are also 
referenced in the technical sections of this report and appear in the 
report bibliography. They are included here for completeness. 



Published works of Patrick Suppes 

1977, Learning theory for probabilistic automata and register machines, 
with applications to educational research. H. Spada & W. F. Kerapf 
(Eds. ) , Structural models of thinking and learning (Proceedings 
of the 7th IPN-3ympo3ium on Formalized Theories of Thinking 
and Learning and their Implications for Science Instruction). 
Bern; Hans Huber Publishers, Pp. 57-79. 

1977, A survey of contemporary learning theories. In R. E. Butts & 

J. Hintikka (Eds,), Foundational problems in the special sciences 
(Part two of the Proceedings of the Fifth International Congress of 
Logic, Methodology and Philosophy of Science, London, Ontario, 
Canada, 1975). Dordrecht: Reidel, Pp. 217-239. 

1977, The future of computers in education. In, Computers and the 
learning society . (Hearings before the Subcommittee on Domestic and 
International Scientific Planning, Analysis and Cooperation,^ of the 
Committee on Science and Technology, U.S. House of Representatives, 
Ninety-fifth Congress, First Session, . October 4, 6, 12, 13, 18, 

and 27, [No. 47].) Washington: U.S. Government Printing Office, 
Pp, 5^3-569. 

1978, Impact of research on education: Some case studies . Washington, 
D.C.: National Academy of Education, 672 pp. 

1978, Performance models of undergraduate students on computer-assisted 
instruction in elementary logic. Instructional Science , _7, 
15-35 (with I. Larsen and L. Z. Markosian). 

1978, Steps toward a variable-free semantics of attributive adjectives, 
possessives, and intensifying adverbs. In K. E. Nelson (Ed.), 
Children^'s language (Vol. 1). New York: Gardner Press, 
Pp. 81-115 (with E, Macken). 

1978, l^e role of global psychological models in instructional technology. 
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In R. Glaser (EdO, Advances in Instructional psycholofiiy (Vol. 1). 
Hillsdale, N. J.: Erlbaum, Pp.229--259 (v/ith E. Macken & M. Zanotti). 

1978, The historical path from research and development to operational use 
of CAT. Educational Technology , (April) 9-12 (with Macken). 

1978, La informatica en la educacion. Nonotza , Revist a de Difusion 
Cientif ica Tecnolo^ica ^ Cultural 13 , IBM de Mexico. 

1979, Current trends in computer-assisted instruction. In M. C. Yovits 
(Ed.), Advances in Computers (Vol. 18), Academic Press, New York. 

1979, Variable-free semantics for negations with prosodic variation. In 
E. Saarinen, R. Hilpinen, I. Niiniluoto, & M* P. Hintikka (Eds.)» 
Essays in honour of Jaakko Hintikka . Dordrecht: Reidel, Pp. 49-59. 



Unpublished works of Patrick Suppes 
1977 

Some global models of learning and performance, neeting of 
American Psychological Association, San Francisco 

Computer-assisted Instruction In University-level Mathematics, 
Department of Mathematics Fall Colloquium Series, 
San Francisco State University 

Computers in Education, First Western Educational Computing 
Conference, San Francisco 

Computer-assisted Instruction, Tata Institute for Fundamental 
Research, Bombay, India 

1978 

Research on Computerized Instruction, School of Edacation, 
Stanford University 

The Future of Computer-assisted Instruction, Incernational 
Symposium of Informatics, Mexico City 

Variable-free Semantics for Natural Language, School of Social 
Sciences of Calif c::nia, Irvine 

Past, Present and Future Educational Technologies, Third World 
Mathematics Conference, Khartoum, Sudan 

Some Remarks on the Semantics of Natural Language, Language, 
Mind, and Brain National Interdisciplinary S5rTnposium, 
Gainesville, Florida 

Variable-free Semantics for Natural Language, Department of 
Linguistics, University of Texas, Austin, Texas 
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August 27 
November 9 

November 19 
December 12 

January 19 
January 26 
February 22 
March 6 
April 7 

April 13 



April 26 



Future Trends in Computer Assisted Instruction,' ComputGr 
Science Colloquium, University of California, Berkeley 



May 6 Computer-assisted Instruction in Community Colleges, Northern 

California Community Colle^>e Computer Consortium, Sierra 
College, Rocklin, California 

May 20 Computers and Productivity in Education, Campus Conference, 

Stanford Alumni Association 

May 25 Computer-assisted Instruction at the University Level, 

California Polytechnic State University, San Luis Obispo 

July 13 Logic and Set Theory in Schools, Australian Logic Teachers 

Association National Conference and Australasian Association 
for Logic Annual Conference, The University of Queensland 

July 13 Analysis of Student Trajectories in CAI Courses, Kelvin Grove 

College of Advanced Education, Brisbane, Australia 

July 14 Computer-based Logic Instruction, A.A.L. /A«L.T.A« , University 

of Queensland, St. Lucia, Brisbane, Australia 

July 14 The Semantics of Natural Language in Logic Courses, 

A.A.L./A.L.T.A. , University of Queensland 

July 14 Computer-assisted Instruction in the Schools and Colleges 

in the United States, Kelvin Grove College of . Advanced 
Education, Brisbane, Australia 

July 21 Variable-free Semantics for Natural Language, Department of 

Philosophy, The Australian National University, Canberra 

August 31 Inductive Logic and Its Applications. International 
Institute of Philosophy, Dusseldorf, Germany 

September 23 Educational Technology and the Future of Education, Teachers 
College, Columbia University, New York 

November 28 Transportability of Curriculums, Digital Users Group, DECUS, 
U.S. Fall Symposium, San Francisco 

December 1 Panel Discussion: Reactions to "Values and Evaluation," 
presented by Michael Scriven. Far Western Philosophy of 
Education Society, 27th Annual Meeting, Palo Alto, California 

December 1 Teaching Logic and Set Theory by Computer, Logic Colloquium, 
Group in Logic and the Methodology of Science, University of 
California, Berkeley 

December 6 Panel Discussion: Conputer-based Courses. ACM meeting, 
Washington, D.C. 

1979 47 
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1977, 



1978, 



1978, 



1978, 



February 2V .ucu« of Computers i„^duca.lon= T.e^s Le^.u.a. 

Inscrucdonal Systems, San Diego, Ca. 

Published H2l!a °1 S£!!K 
Blaine, and K. S.lth, "I-^Jl^^f^-^^^" = ^flZllli^/"' 

Computing Machinery, Seattle. 

Blaine, L., A. Levlne K. '•^tf/sltSsk'SJSSlSi 
problems In Jmplementlna l' - "l7,""^„,"\- i7;riii^niSS. 
^I^hsSi is. SSf^^^^i^^ TSVmlJlS: Studies 
Ir^S"sicranLe°n;e^!'strfr.d%nlvetsUy. 

„a.^a, «. and T Pettlt ^-^^^^J^^m^f^'^s 
irrniliorars^irn^rstr^^id university. 

o J T>,o MTSS soeech synthesis, system^ 
Levine, A. and W R. f^"/-^;,^,f^,^gk^n,-^n^udies 
Technical Report No. 299, ^"f ..^g_„itv. 
in the Social Sciences, Stanford Uni/ersity. 

^ U R Sanders, "The MISS speech synthesis system 
U^^guMished wo^ 

1977 

P,„t western Edooatlonal Computing Conference, San Franclseo. 

Blaine, L. "The Excheck System". 

society Of Meeting, Miami. Florida. 

Levine A., "Variation in sentence duration due to lexical content, 
s^mankc emphasis, and text structure . 

1978 ■ 

western Educational Computing Conference, Anaheim. 
Blaise L. and J. McDonald, "Interactive processing in the EXCHECK 
system of natural mathematical reasoning. 
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Blaine, L., M. Davis, S. Lindstrom and R. LaddaRa, "Courses Caught 
using Che EXCHECK system." 

Markosian, L. Z., R. Sanders, H, Seropian and J. van Canpen, 
"A language instruction program for Armenian using 
computer-synthesized speech," 

McDonald, J., L. Blaine, J. Marlowe and R* Roberts, "The EXCHECK 
system," 

Laddaga R., J. Marlowe, T. Pettit, and L, Markosian, "The VOCAL 
system," 

Sanders, W. R. , A. Levine and R. Laddaga, "The MISS audio system." 

Acoustic Society of America , 96th Meeting, Honolulu, Hawaii, 

Levine, A. "Musical intervals in the elaboration of underlying tonal 
melodies"» 

W. R. Sanders, C. M. Gramlich and A. Levine, "The sensitivity of LPC 

synthesized speech to the imposition of artificial pitch, duration, 
loudness and spectral contours". 

1979 

Computer-Based Education; Mission of the Future 
February, 1979 meeting at San Diego of the 
Association for the Development of Computer-Based Instructional Systems 

W. R. Sanders, A. Levine and R* Laddaga, "The Use of MISS in 
Computer-Based Instruction". 

Other presentations 
Levine, A. Six lectures on "Computational linguistics, speech synthesis 
and CAI," presented at Bar Ilan University , Ben Gurion University, 
Hebrew University, University of Tel Aviv, the Weitzman Institute; 
Israel. 
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Appendix 
Sample Proof 



The following is a sample proof in an elementary set theoretic system 
which demonstrates features of the interactive heuristic natural deduction 
theorem prover in addition to those illustrated in Section 2.2. A slightly 
different method of display has been selected by the student, so that only 
the immediate goal is printed in any frame. All student input is 
underlined (the continuation of the command being supplied by the program). 
The horizontal lines separate different frames (previous material 
disappears from the screen). 



Derive: -^(A 6 B and B € A) 
*reduce 

IJhich proof procedure? (negation) *reductio ad absurdum 
Formula to be contradicted (formula) ^theorem *reRu larity 

If A is nonempty then (3 x) [x 6 A & (V/ y)(y € x y ^ A)] 

Specify for A? *{A,B} 

[The student decides to use a proof by reductio ad absurdum for 
this derivation. (This is not the default procedure.) The 
program asks for a formula to be used as one half of the 
contradiction; the negation of the given formula will be the 
other half. An option exists which allows the student to defer 
proving the contradiction. Since the student has chosen to 
contradict a theorem, only the negation of the theorem needs to 
be proved.] 



Show: (G2) -.(<A,B} ^ 0 (3 x) [x e <A,B} & (\/ y) (y 6 x & y /{A,B»]) 

Assume (1) A 6 B and B 6 A 

^redu ce 

Which proof procedure? (negation) *full negation 

[The option "negation" takes a negated (complex) goal and 
produces a non-negated formula, using elementary logic. For 
example, from -n(P -^0) we get (P & -*Q) . The option "full 
negation" carries out this procedure recursively throughout the 
formula . ] 
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Show: (G3) iA,B} ^ 0 & (Vx) [x 6. {A,B} --^^ 13 y) (y 6 x & y {A,B})] 
Assume (1) A ^ B and B ^ A 



^redu ce 

IJhlch proof procedure? (conjun' tion) ^^conjunction 

[The typing of the ESC k^ay alone (represented as $^) at any point 
selects the default (listed in parentheses).] 



Assune (1) A B and B £: A 

^esta blish 

(2)*::G{A,B} t^O 

[V7ith Establish, the student derives the first conjunct. Typing 
a control-G (^G ) selects the current goal for the formula. 
When the inference to the current goal is accepted, the program 
goes on to the next goal which is, in this case, the second of 
the tvo rr>njuncts from the goal (G3).] 



Show: (G5) (V'x) [x {A, B} (3 y) (y x & y €i {A,B}) ] 

Assume (1) A B and B <Sr A 

^redu ce 

IThich proof procedure? (universal) *_$universal 
Which variable do you want for x? (x) *^x 

[The student has the choice of variable to use in the goal.] 



Show: (G6) x ^ {A, B} (3 y) (y 6 x & y 6 {A,B}) 

Assune (1) A B and B 6 A 

^redu ce 

IThich proof procedure? (conditional) ^^conditional 



Show: (G4) 
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Show: (G7) (a y)(y € x & y € {A,B}) 



Assume (1) A € B and B € A 
Assume (3) x £ {A,B} 

^redu ce 

Which proof procedure? (existential) ^cases 

Case 1 Assumption *x A 

Conclusion *_22.(3 y) (y € x & y € {A,B» 

Case 2 Assumption *x B 

Conclusion y) (y € x & y 6 {A,B» 

[The instance of y that would satisfy the goal depends upon what 
X is. From (3), x may be either A or B- Given this, the 
student decides to do a proof by cases: showing that (G7) 
follows no matter whether x is A or x is B.] 



Show: (08) X A x B 

Assume (1) A € B and B 6 A 
Assume (3) x £ {A,B} 

*3 esta blish 

(4)^Gx A V X = B 

[The student must first prove the disjunction that justifies 
breaking the proof into cases.] 



Case 1: 

Show: (G9) (3 y) (y 6 x & y e {A,B}) 

Assume (1) A 6 B and B € A 
Assume (3) x € {A,B} 
Assume (5) x =» A 

*r edu ce 

Which proof procedure? (existential) ^^existential 
Which variable do you use for y? (y) *B^ 

[In the first case, the student wants to show that if x equals 
A, then the conclusion follows. The student selects the proper 
instantiation of the goal to work for, using B as the required 
term. ] 
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Case 1 : 

Show: (GIO) B £ X & B £ <A,B} 

Assume (1) A 65 B and B 6; A 
Assume (3) x €<A,B} 
Assume (5) x » A 

*1,5 est ablish 

(6) *^GB e X & B 65 <A,B} 
6 EG (7) O y) (y € X & y <A,B}) 

[The goal is now an obvious truth of set theory, which is proven 
by ESTABLISH. Once the student proves (6), the goaling 
machinery can use the inverse of the reduction procedure for 
existentials, already specified by the student, to satisfy the 
previous goal. This finishes the first case, where x is A.] 



Case 2: 

Show; (Gl 1) O y) (y e X & y 6 <A,B}) 

Assume (1) A e B and B & A 
Assume (3) x {A,B} 
Assume (8) x » B 

*redu ce 

Iftiich proof procedure? (existential) *$_existential Term *A 

[Once the first case is finished, the student goes on to the 
second, where x = B. That case is dealt with in a similar 
fashion in succeeding frames.] 



Case 2: 

Show; (G12) A e x & A C <A,B} 

Assume (1) A & B and B 45 A 
Assume (3) x c {A,B} 
Assume (8) x =» B 

*1,7 esta blish 

(9) *^GA 6 X & A ^ <A,B} 
8 EG (10) O y)(y e X & y 6: <A,B}) 
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Derive: -^(A 6 B and B € A) 



Assume (1) A £ B and B 6 A 
Establish 

(2) {A,B}7fO 
Assume (3) x 6 {A,B} 
3 Establish 

(4) X = A V X = B 
6 EG (7) (3y)(y € X & y e {A,B}) 
9 EG (10) (3y)(y e X & y 6 {A,B}) 
4,7,10 Cases 

(11) (3y)(y € X & y 6 {A,B}) 
3,11 CP (12) X 6 {A,B} -^(3 y)(y € x & y i5 {A,B}) 
12 UG (13) (\/x)tx 6 {A,B} (3 y)(y € X & y 6 {A,B})] 
2,13 FC (14) {A,B} ^ 0 Si (b^x)[x 6 {A,B} ~t (3 y)(y € x & y ^ {A,B})] 
14 Full Negation 

(15) ({A,B} ^ 0 C3 x)[x ^ {A,B} & (\/y)(y « X & y {A,B})]) 
1,15 Contradiction using Th. Regularity 

(16) n(A e B and B € A) 

Use QED to finish your proof 

[Since (G12) is the last unsatisfied goal, the goal machinery 
can use the inverses of the reduction procedures, already 
specified by the student, to finish the proof. This is done by 
generating formulas to satisfy those goals all of whose sub- 
goals have already been satisfied. Thus, once (9) is inferred 
using existential generalization, (10) is inferred by the proof 
by cases. Lines (11) through (14) are inferred using the proof 
procedures: Conditional, Universal, Conjunction and Full 
Negation. Finally, the overall goal is derived in a reductio 
ad absurdum from the contradiction between (14) and the theorem 
of regularity. ] 
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Appendix Q 



Reduction Proof Procedures 



ons 



The following is a partial list of the proof procedure reducti 
available in the interactive heuristic natural deduction theorem prover. 
Section 2.2, above, contains a discussion of proof reductions. The 
expression to the left of an ^|=>' denotes the allowed assumptions and the 



upper line 


reads: 


prove A C from +. The whole schema reads: reduce the 


problem *prove A 


C from to the subproblem *prove C from +,A'. 
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5 !=•. (V x)( A[x] C[x] ) usually definition or theorem 

In p ?<^-t i cul ar, identities and memberships can be reduced by 
appiijiri;.] tha appt-'Dpriate theorem or definitionj e.g.; 

5 |=> A e /oB 



5 |=> (V x)( X e pB x.c B ) theorem of poujerset 
5 ]=> A c D 
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